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1. Introduction and historical background of the 

PROBLEM 

We consider here a problem of finding the sharp estimate for the boundedness of an arbitrary 
Calderon-Zygmund operator in L'^{w), w G A2. In the 70s Hunt-Muckenhoupt-Wheeden 
found a wonderfully simple characterization of weights for which the Hilbert transform is 
bounded from L'^{w) to itself. The problem had prediction theory background, because for 
a scalar stationary stochastic process weight has a meaning of its spectral measure density, 
and the boundedness of the Hilbert transform has a meaning of positive angle between the 
Past and a Future of the process-a good property of the processes, sort of their regularity. 

As such the problem of the boundedness of the Hilbert transform has been already at- 
tacked by Helson-Szego and Helson-Sarason. They also obtained a chracterization for the 
boundedness of the Hilbert transform in a weighted L^. The answers were equivalent (of 
course) but totally different. Till nowdays nobody knows how to obtain directly Helson- 
Szego condition from Hunt-Muckenhoupt-Wheeden condition. What we are doing below 
has some very vague flavor of going in this direction. Notice that the Helson-Szego-Sarason 
approach was developed for p 7^ 2 by Cotlar-Sadosky in [8]-[T2]. In the 80s a new point of 
view was introduced by Sawyer |17], [38], his treatment was concentrated on positive oper- 
ators (the Hilbert transform is not one of them), and he introduced the test conditions: to 
check the boundedness of a certain class of (positive) operators it turned out to be sufficient 
to check the uniform boundedness on a (non-linear) family of test functions, usually a col- 
lection of characteristic functions of some sort. Simultaneously in the 80s David and Journe 
|18j . [19] built a theory of Calderon-Zygmund operators (here the Hilbert transform belongs) 
based on so-called Tl theorem. A closer look shows (but to the best of our knowledge no- 
body then made this closer look) that Tl theorem is exactly Sawyer's test conditions. The 
difference was that there was no weight (life is easier), this was Lebesgue measure theory, 
but the operators were not positive, but rather singular (life is harder). Later Tl theory 
was proved to be fine not only for Lebesgue measure, but still the measure should have 
some smoothness: this was done by Christ in [6]. At the end of the 90s a nonhomogeneous 
measures were included into Tl theory: see [S2]^[3S], [S5], [IS], [SU] . 
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At the same time at the beginning of 90s several important papers appeared, which 
showed how bounded is the Calderon-Zygmund (or maximal) operator if Hunt-Muckenhoupt- 
Wheeden conditions are satisfied. It was the return to the 70s but on a new turn of the spiral. 
The questions of sharp weighted estimates appeared and seemed to be interesting not only 
for their own sake but mostly because they were needed by a) multiparameter Harmonic 
Analysis, b) sharp and especially critical exponent estimates for certain elliptic PDE. The 
first sharp estimate was obtained by Buckley [3j, he proved that the L'^{w) — ?• L'^{w) norm 
of the Hardy-Littlewood maximal operator grows at most as the first degree of the so-called 
A2 norm [w]^,^ of the weight. The proof was not easy. Now there exists a proof due to Lerner 
which takes only several lines. For Calderon-Zygmund operators (namely for the Hilbert 
transform and such) R. Fefferman and J. Pipher [2T] got a linear estimate in terms of Ai 
norm of weight (it is another important characteristic, see below). 

Then people started to consider not Calderon-Zygmund operators, not positive operators 
of maximal type or potential type, but their models on dyadic lattice. The simplest and 
most well used singular dyadic operator is called Martingale transform (see, e. g., [1], |2], 
|1]): for it, the sharp linear estimate in terms of A2 norm [w]a^ of the weight was obtained 
by Wittwer in [5B] . The interesting feature of her proof was that she used as a template a 
two- weight Martingale transform estimate of |10] . Both |10] and [58] sue Bellman function 
proofs. These two things: the use of two-weight approach (notoriously difficult for Calderon- 
Zygmund operators), and the use of the Bellman function technique became the features of 
sharp weighted line of research. 

Explanation may be the following: if one wants a sharp estimate, one is in a paradoxical 
situation: one should not use the good properties of weights, but one must use them! The 
exit is like that: use the good property but only once. The rest of the proof should be 
working for very bad measures (weights). This is how nonhomogeneous Harmonic Analysis 
and two-weights estimates come into play probably. We will see this below. 

The Bellman proofs persisted, and in |13] the first for-real Calderon-Zygmund operator 
got a sharp estimate by the first power of the norm [wj^j. This was the Ahlfors-Beurling 
operator, and its sharp weighted estimate allowed the authors to solve a problem of Iwaniec 
on a borderline regularity of Beltrami PDE. 

The Hilbert transform turned out to be more difficult to treat, but in [H] Petermichl 
proved the linear estimate in terms of norm [w]a2 for the Hilbert transform as well. Then 
in [U] she did this for the Riesz transforms. The Ahlfors-Beurling operator is the averaging 
of Martingale transforms (see [20j), and this being established, the result of became the 
corollary of [58]. On the other hand, as Petermichl showed in [46], the Hilbert transform 
is the averaging of the next-in-complexity dyadic operators: dyadic shifts. There are many 
more and more complex dyadic shifts, the linear estimate for all of them in terms of [w]a2 
was shown recently in a very interesting paper of Lacey-Petermichl-Reguera [2B]. And then 
another proof appeared in almost impossibly simple and beautiful papers of Cruz-Uribe, 
Martell, and the first author [H], [15] and [7]. They used an extremely beautiful "formula" 
by Lerner [27] . 

So now we have a linear in terms of [w]a2 estimate for all dyadic shifts and all their 
averages, which is a subclass of quite smooth Calderon-Zygmund operators. However, the 
general Calderon-Zygmund operator is not a simple average of dyadic shifts. Moreover, 
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dyadic shifts have "depth" r, which is the measure of their complexity. It is easy to get an 
estimate of their norms exponential in r. But this is bad if we want to give a linear in [w]a2 
estimate for all Calderon-Zygmund operators. 

So we naturally come to the question to obtain a linear in [w]a2 estimate for all Calderon- 
Zygmund operators. We almost get it. The reader can see this below. 



2. Main results 

In what follows w is a weight in A2, which as we know means 

[w]a2 ■= sup{w)i{w~^)i < 00 , (2.1) 
I 

the quantity [w]a2 will be called the "norm" of the weight. Operator T will be always a 
bounded operator in L^(R'') with Lebesgue measure such that 

{Tf,g) = j K{x,y)f{y)g{x)dydx (2.2) 

for all nice /, g having disjoint supports. Here K{x^ y) denotes the kernel of the operator 
and it will be always Calderon-Zygmund (CZ) kernel. That means 

\K{x,y)\< 



F - y\ 



\K{x, y) - K{x', y) \ + \K{y, x) - K{y, x') \ < )f_ ""'jl :\x - x'\ < - y\ ■ (2.3) 

\x y\ z 

Notice that K does not define T uniquely, the identity operator and all operators of multi- 
plication on a bounded function have the same kernel K — 0. Anyhow, such operators are 

called Calderon-Zygmund operators (boundcdness in with respect to Lebesgue measure 
and abovementioncd properties of the kernel). So in our definition the identity is also a 
Calderon-Zygmund operator, but of course a non-interesting one. By T' we understand the 
corresponding transposed operator, its kernel is K{y,x). 

In what follows C, c with indices denote absolute constants and constants depending on 
d and e only. 

We are going to prove two main results. 

Theorem 2.1. ||r||L2(«;)^L2(«;) < ci[t(;]A2 + C2(||r|| + ||7"||l2(«;-1)->L2.°°(«;-1)) • 

Remarks. 1. Obviously ||T||j^2(-^-)^^2,oo(^) < ||T||i2(^-)_j.^2(^-). For all "interesting" Calderon- 
Zygmund operators also [w]a2 < c||T||i2(^)^^2(^). So the theorem gives a "formula" for the 
norm. 

2. It is a bit amazing what it "almost" says: if Calderon-Zygmund operator is of weak type 
(2,2) then it is of the strong type (2,2). Moreover, its weak type norm coincides (up to 
a constant) with a strong type norm! The last outrageous remark is basically true for all 
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"interesting" Calderon-Zygmund operators, if we agree to call interesting those for which 

[w]a2 < \\T\\l^(w)~^L^-'^(w)^- 

Our next result gives a pretty good (almost perfect) estimate of ||T||^2(-^)^j;^2,oo(^) (and 
thus of ||T||j;^2(^)^^2(^) by Theorem 12. ip . Here T is any operator. It is an abstract theorem. 
Along with A2 class we need Ai. The weight is said to belong to Ai if for any cube / 

{w)i < B Mw{x) . (2.4) 
The smallest B serving for all cubes / is called its Ai "norm": [u^Jai- 

Theorem 2.2. Let be any function on [1, 00), 0(t) > t. Let the operator T has the property 
that for any w & Ai 

||T||ii(^)^ii,oo(^) < ci4>{[w]aJ ■ (2.5) 

Then this operator satisfies 

\\T\\l^w)^L2-°-{w) < Ci0(c2Ha2) • (2.6) 
Let us combine this theorem with a remarkable result of Lerner-Ombrosio-Perez: 
Theorem 2.3. Let T be an arbitrary Calderon-Zygmund operator. Then 

\\T\\mw)-,L^,°-{w) < Ci0(Hai) , where (t){t) = tlog(l + t) . (2.7) 
We obtain for any Calderon-Zygmund operator 
Theorem 2.4. Let T be an arbitrary Calderon-Zygmund operator. Then 

\\T\\l2(w)^l^,o.{w) < ci[w]a2 log(l + Has) • (2.8) 
Combining this with our first Theorem 12.11 we get 
Theorem 2.5. Let T be an arbitrary Calderon-Zygmund operator. Then 

\\T\\lHw)^lHw) < ci[w]a2 log(l + Maz) • (2.9) 

Remarks. 1. This almost linear estimate should be replaced by a linear one. So far the 
obstacle is in Theorem! 



The plan of the paper: we first prove Theorem 12.11 It will take several sections. Then 
we prove Theorem 12. 2[ To prove Theorem 12. II we need two more theorems. Let us introduce 
some notations: T^-i denotes the operator Tw~^ considered on L'^{w~^dx), T^ denotes the 
operator Tw considered on L'^iwdx). Notice a simple isometric formula 

||T^-i : L\w-^) -> L^w^ = ||r|U2(^)^i2(^) . (2.10) 

For getting rid of inessential question we always think that our weight satisfies 

A < w < L, 
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for a very small positive A and a large L. But all estimates of norms we want to have 
independent of A and L and dependent only on things like [w]a2 and such. However, the 
assumption above allows to think that T is already bounded wherever we want-the goal is 
to find the bound. 

Looking through all cubes / we denote by the smallest constant such that 

\\Tw-^Xi\\h^^dx) < , and \\T^Xi\\l^n,-^dx) < ^x«^W • (2-11) 

Theorem 2.6. < \\T\\l2{w)^l'2'°<'{w) + ||^'||l2(«)-i)^l2.°o(«,-i) • 

Theorem 2.7. ||T||L2(^)^i2(^) < ci[w]a2 +C2^/K^. 

Obviously the combination of Theorems 12. 6[ 12.71 gives our first main result, namely. 
Theorem 12.11 



3. The beginning of the proof of Theorem 1277 



In what follows we use Nazarov-Treil-Volberg preprint [38]. We used the following fixed 
notations: 

dfx := dx , du := w dx . (3.1) 

Let / G L'^{fi),g G L'^{i^) be two test functions. We can think without the loss of 
generality that they have the compact support. Then let us think that their support is in 



[i, l]"^. Let V^, be two dyadic lattices of M''. We can think that they are both shifts of the 

Id 



same standard dyadic lattice V, such that [0, 1] G V, and that = V + ui,V = V + U2, 
where vectors coi,co2 have all of their coordinates in [— |, ;|]. We have a natural probability 
space of pairs of such dyadic lattices: 

n:= {{0:^,002) el-^-r} 

provided with probability P which is equal to normalized Lebesgue measure on [— ;j, |]^". We 
called these two independent dyadic lattices V^, V because they will be used to decompose 
/ G L'^{fi),g G L'^{i^) correspondingly. This will be exactly the same type of decomposition 
as in the "nonhomogeneous Tl" theorems we met [32]- [36]. We use the notion of operators 
, A J. (Notice that we will always keep the name / for cubes from V^, and we always keep 
the name J for cubes from !) By this we mean the following. In L'^{^) there is a subspace 
Hj of function supported on I and having constant values on each son of I and such that 
the average value of a function with respect to dfi is zero. The orthogonal projections 

A^* : L\fi) ^ (3.2) 

are mutually orthogonal. Similarly, the orthogonal projections 

A^L2(/i)^?^:; (3.3) 



are mutually orthogonal. 
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In particular, we will be often using equalities 

I I VeL2{/.),||Vlli2(^)=i I 

We always use the fixed notation 



■ ||l2(/,) , II ■ \\u '■= II ■ ||l2(,,) , (-, := (-, , (-, ■)u := (-, ■)l^{u) ■ (3.5) 



Let us rewrite (13 .4^ (with the change of to and of course / to J) as another equality 
which we will be using often 

E 11^:^/^11' = II E^^/^ii'= \Y.^fj^^''jm.?- (3.6) 

J J Vei2M,||V||.=i J 

One piece of notation: Given a cubes I G I we introduce 

Here c(J) denotes the center of the cube I. 
Here is our first lemma. 

Lemma 3.1. Let dn = w^^dx^dv = wdx, w G A2. Then for any cube I G and any 
collection of disjoint open cubes la, la C. I, we have 

Y.\^iAxi\iM]Ml.) < K^j^il) , (3.7) 

a 

with K = c[w]\^. 

Proof. It is easy to see that 

[PiMimM < c inf M{xiw-'){x) . 

So 

< c I [M{xiw~']\x) dwix) . 

The last quantity is bounded by c [wW^ w^^{I) by Buckley's theorem, see P|. We are done. 

□ 

Introducing pivotal constant K. Let us denote by K the smallest possible quantity in 
the right hand side of (13. 7p . We call this constant the pivotal constant. Let K = 100 i^. 

Let us introduce the following notations: Let dfi = w^^dx^dv = wdx, w G A2. Consider 
an arbitrary cube / G and call a dyadic cube I G stopping if it is a maximal cube 
such that 

[Viixi\idfx)]'u{I)>kfx{I). (3.8) 

Let us notice then 
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Lemma 3.2. Let dfi = w ^dx,di> = wdx, w G Let {la} denote stopping suhcuhes of 1. 
Then 

J2 f^iQ < If^in ■ (3.9) 

a 

The proof is obvious from f l3.8p . the choice of K and from Lemma [3.11 We can introduce 
the pivotal constant if we change n to v but in the case /i = w~^dx, v = wdx, w E A2 it will 
be the same c[w]^2 because of the symmetry [w]a2 = ['W^~^]^2- 

The rest is devoted to the proof of the following result from [38] . In this result measures 
fi, V are arbitrary, even the doubling property is not assumed. We use the notations 

[/i, v\a2 := sup(/x)/(i/)/ , where := . 

I K I 

Theorem 3.3. Let /i, v he two arbitrary measures on R"' satisfying the pivotal condition 
(13. 7p and its symmetric version with /i and v exchanged, both with constant K . Let T he an 
arbitrary Calderon-Zygmund operator. Let the following test conditions he satisfied as well: 

WT^^XiWl < K^Mn , WTuXiWl < K^HI) ■ (3.10) 

Then 

\\T : L2(/i) ^ L\u)\\ < Cov1^^;^+ Civ^ + C2v^. (3.11) 

The proof of Theorem 12.71 follows from Theorem 13.31 immediately if we take into con- 
sideration that we proved that the pivotal constant for fi = w~^dx, v = wdx, w E A2 is 
K = c[w]\^. Even though the above theorem is proved in |38] we repeat here the proof 
with some modifications. We do this to slightly simplify |38] and to make the roles of the 
contants involved in the proof completely transparent. 



4. The proof of Theorem 13.31 . The start 

Fix two test functions / G L'^{n),g G L'^{u) and consider 

A^(/),A,^(^). 

Also, let Jq denote the cube of of side-length 1 containing supp(/), the same about /q 
changing f to g and ^ to u. 



AW:=(/ /rf/i)X/^ A^(^7):=(/ 9du)Xi^,- 

It is easy to see that functions A'^(/), A^(/), J G T>^ are all pairwise orthogonal with 
respect to the scalar product (■, ■)^ of L'^i^jj). The same is true for A'^{f), Aj(/), I with 
respect to the scalar product (■, ■)i, of L'^{u). Thus, 
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/ = A'^(/)+ J2 A^(/)' ii/ii' = iiA"(/)iiJ+ E w^iifK- (4-1) 

Similarly, 

9 = A''i9)+ E ^ii9), \\9\\l = W{9)\\l+ E 11^/(^)11'- (4-2) 

These decompositions and the assumption (13.111) imply in a very easy fashion that we 
can consider only the case 

A^(/)=0, A^(^7) = 0. (4.3) 

In fact, iT^f,9)u = (T^f - h.^U)^9)u + (/^m / rfyu)(T^(x/,^), 6')^., and the second term is 
bounded by trivially by (I3.10p . Using (I3.10p one can get rid of I^^{9) as well. 

So we always work under the assumption (14. 3p . We pay the constant c^fK^ to use this 
assumption. Now, for simplicity, we think that /, 9 are real valued. The proof will consist of 
cutting the sum below into several subsums (there seems to be at least seven of them) and 
using the cancellations separately in those sums: 

(T,/,<7).= Mif),^ji9))u. 
4.1. Bad and good parts of / and g 

We use "good-bad" decomposition of test functions /, 9 exactly as this has been done in 
j32j . [34]-[36]. Consider two fixed lattices "D^, T>'^ (so we fixed a point in fi, see the notations 
above). Fix forever 5 as follows: 

6 = — ^— . (4.4) 

2{n + e) ^ ^ 

We call the cube / G T>^ bad if there exists J G T>^ such that 

|J| > |/|, dist(e(J),/) < 1{JY-H{IY . (4.5) 
Here e( J) is the union of boundaries of all sons of J. Similarly one defines bad cubes J G T>'^ . 

Definition. We fix a large integer r to be chosen later, and we say that / G T>^ is essentially 
had if there exists J G T>^ satisfying (14.51) such that it is much longer than /, namely, 
£(J) > 2"£(J). 

If the cube is not essentially bad, it is called good. 

Now 

/ = fbad + fgood, fbad '■= ^ A^/ . (4.6) 

/€!l"^,/is essentially bad 

The same type of decomposition is used for g: 

g = gbad + ggood, gbad-= E A5^5(. (4.7) 

J&V , J is essentially bad 
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4.2. Estimates on good functions 

We refer the reader to [32], [S1]^[SS] for the detailed explanation that it is enough to estimate 
\{Tfj_fgoodggood)u\- However, here we also give an explanation for the sake of completeness. 

{Tfif, g)u = (T^fgood, ggood)u + {T^fbad, ggood)u + (T^f, gbad)u ■ (4.8) 

We repeat here sketchingly the reasoning of [32], [31]-[36]- In [32], [3l]-[36] we proved the 
result that the mathematical expectation of ||/bad||/x, ||5'bad|UI| is small if r is large. In fact, 
the proof of this fact is based on the observation that the conditional probability 

¥{{ui, U2) en-. I is essentially bad | / G V^"} < r(r) ^0, r ^ 00 . (4.9) 

So we consider the following result as already proved. 

Theorem 4.1. We consider the decomposition of f to bad and good part, and take a bad 
part of it for every u = (001,002) G Q. Let E denote the expectation with respect to {Q,F). 
Then 

mfbadh) <e{r)\\f\\^, where e{r)^0, r 00 . (4.10) 

The same with g: 

HWgbadWu) < e{r)\\g\\^, where e{r) ^ 0, r ^ 00 . (4.11) 



Coming back to (14. 8 p we get 

\{T^j.f,g)u\ < \iT^ f good, g good) u\ + \\T^\\\\ fbad\\ tJ.\\ggood\\u + fi 1 1 gbad I U S 

\iT^fgood,ggood)u\ + 2C£:(r) || / || ^ || 51 1| ^, 

where C temporarily denotes \\T^\\L^{f_i)^L'^{u) (a priori finite, see our assumption \ < w < L 
above). Choosing r to be such that Ce{r) < j, choosing f,g to make\(T^f, g)u\ to almost 
attain C'||/||^||5'||!/, and taking the mathematical expectation, we get 

^'^ll/IUII^IU < mT^fgood,ggood)u\ 

for these special /, g. If we manage to prove that for all /, g 

\iTJgood,ggood)u\ < c{Vk+ /^)||/IUI|^?L V/ e L\fi),\fg G L\u), (4.12) 
then we obtain 

which finishes the proof of Theorem 13.31 
The rest is devoted to the proof of (14.121) . 



5. First reduction of the estimate on good functions (14.121) . The diagonal part. 
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5. First reduction of the estimate on good 
FUNCTIONS (14.121 ). The diagonal part. 



< 



So let lattices V^,V^ be fixed, and let f^g be two good functions with respect to these 
lattices. Boundedness on characteristic functions declared in (13. lip obviously implies 

\{T,/\'if,^^jgU < v^||a^;/||,||a:;^||.. (5.1) 

In fact, in the left hand side we have 2^^ x 2^ terms enumerated by sons of / and sons of J . 
Let sj, dj be two such sons, and A^/ = Cg on s/ and Ajgf = q on dj. Obviously, 

- /.(.,)V2 ' i^'^i - ^(d,)i/2 • 

Then 



V^x\mf )U^ji9)\\u . 

Therefore, in the sum (T^f,g)^ = ^^g^,^ jg^,^(T^A^, Ajgf)^ the "diagonal" part can be 
easily estimated. Namely, by (15.11) (below r is the number involved in the definition of good 
functions in the previous section, and we always have I G V^, J G V without mentioning 
this): 

J2 liT.A^fA'jgU < y^JfWMl- (5-2) 

2-*VI<l-'"l<2*|J|,dist(7,J)<max(|/|,|J|) 

6. Second reduction of the estimate on good 
functions (14.121 ). a piece of long range 



INTERACTION 

Let us consider the sums 

Si:= E |(r^A^/,A:;^;).|. (6.1) 

2-d'-|j|<i/|<|j|,dist(/,j)> e{j) 
S2:= |(T,A^/,A:;^).|. (6.2) 

2~*|/|<|J|<|-f|,dist(/,J)>£(7) 

They can be estimated in a symmetric fashion. So we will only deal with the first one. 
Lemma 6.1. Let \I\ < \ J\, dist(J, J) > i{J). Then 

K^-^W-^"'*'^! S (dist(J. J) +'/(/) + ,(,))... a(J)'^'K^)'^1A?/||JA;,||. , (6,3) 
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Proof. Let c be the center of cube I. We use the fact that / A^/ (i/i = to write 
{T,A'if,A''jg), = lMt)l^diy{s)K{t,s)A'if{t)Ay,g{s) = 



di^it) / dv{s){K(t,s) - K{c,s))A>if{t)A''jg{s) 
I J J 



Then one can easily see that 



\{T,A^J,A^jg\\<A / / J%^^\A^J{tmUs)Wit)dy{s)- (6.4) 



/ J J 



\t - s\ 



Now we estimate the kernel jf^p^^^^^^Xiit)Xj{.s) < A (dist(j,j)+ff)+^(j))''+^ ^^ing that |/| < 
I J|, dist(/, J) > £{J). On the other hand 

IIA^'/IUiM <M/)'/lA^/||„ \\A^gh.^^^<^^{jy/'\\A'^g\l. 

And the lemma is proved. □ 

Let us notice that Lemma 16.11 allows us to write the following estimate for the sum of 
f lOll (as usual / G r"^, J G V): 



oo 



S.<E2- E (d,st(/ j) + ff) + £(j))^.. ^w''''''^>'''ii^''^ii^ii^^^ii- 

n=o i,j-i{i)=2-^i{j) ^ V ' ^ y ' V 

(6.5) 

Or 

n=0 fceZ 7,J:^(/)=2-"+fc,^(J)=2'= ^ V ) ^ ^ 



(6.6) 



To estimate "the n, A;" slice 

/,J:^(i')=2-"+'=/(J)=2'= ^ V ' / / 

let us introduce the notations. 



Ky{t, s) := 7— f7 zWdTe' ^ > 0' ^' ^ 



Also 

(y+|t-s|)'^ 

Then 

dz/(s)ir2fc(t,s)(^(t)V'(s) . (6.7) 



^,k < [ dli(t) [ 



6. Second reduction of the estimate on good functions (14.121) . A piece of long range interactionlS 



Lemma 6.2. The integral operator f ^ J Ky{t, s)(f{t) dfi{t) is bounded from L'^{fi) to L'^{i') 
if [/i, u\a2 < oo (recall that this quantity is equal to sup/(-]u(yu)/(z/)// Its norm is hounded by 

Let us postpone the proof of this lemma, and let us finish the estimate of Ei using it. 
First of all the lemma gives the following estimate: 

By Cauchy inequality 

k k JeI?'',£(J)=2* 7eX>'',£(/)=2-"+* 



(Eii^:^^ii')'^'(Eii^'/iiy'^'^ii/iuii^ 

by dUD. Then §M) gives Si < Er=o 2"" Efc and so 

oo 

Si < c [/i, u]'/^ E 2-"ll/IUII^?L = c [/i, i^]]l!\\fU9 



n=0 



and our first piece of long range interaction sum Si is finally estimated. 
Proof of Lemma 16.21 

Let us consider several other averaging operators. One of them is 

Iifi{s) := j - t)ip{t) dfi{t) . 

Another is as follows: let G be all cubes £k, k = {ki, ka) of the type [2ki, 2ki + 2] x ■ ■ ■ x 
[2kd, 2kd + 2], fcj G Z. Consider 



Consider also shifted grid G{x) = G + x,x G [0, 2)*^, and corresponding Ag{x)- 
Notice that 

I(p{s)<a / AG(x)^{s)dx. (6.8) 







In fact, consider [0,2^, ^dx as an obvious probability space of all grids G{x). Then it is 
easy to see that for every s the unit cube [s — s+ |]'^ is (with probability q > 0) a subcube 
of one of the cubes of G{x). Then the above inequality becomes obvious. 
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On the other hand, the norm of operator Aq as an operator from L'^{^) to L'^{i') is 
bounded by c [yU, z^JY^^- In fact, if 4 = [2A;i, 2fci + 2] x ■ ■ ■ x \2kd, 2kd + 2], then 

\\Ag^\\1<Y.^I \v\dfi)Mh)<Y.^[ l¥^prf/i)K4)M4)< 

ck^^lAaV / \ip\'^dfl = c[fi,iy]A^\\f\\l. 

The same, of course, can be said about || Then (16.81) imphes that the norm of aver- 

aging operator / from to Iv^(z/) is bounded by c [fi, Let us call by Ir the operator 

of the same type as /, but the convolution now will be with the normalized characteristic 
function of the cube [— r, r]: 

iMs) := J X[-r,r]4s - t)v{t) dfi{t) . 

It is obvious that the reasoning above can be repeated without any change and we get 

\\lMl<c[f^,'y]]l!\\f\\l (6.9) 

To finish with the operator given hj f J Ky{t, s)(p{t) dfi{t) as an operator from L'^{fi) 
to L'^{i')i let us notice that (and this is a standard inequality for the Poisson-type kernels) 



Ky{t,s)\ip{t)\diJi{t) < c X]2-'=^(/,.2.|^|)(.) . 



fc=0 



Now Lemma 16.21 follows immediately from (16.91) and the last inequality. 



7. The rest of the long range interaction 

As always all J's below are in 'D'*, all J's below are in T)'^ . Consider now the following two 
sums. 

E IMf.^'jgU. (7.1) 

|/|<2-*|j|,/nJ=0 

E \{T,/\U,/YjgU. (7.2) 

\j\<2-'i-'\i\,ir\J=% 

They can be estimated in a symmetric fashion. So we will only deal with the first one. 
Notice that /, g are good functions. These means, in particular, that J, J, which we meet 
in (17. ip satisfy 

dist(J,9J) >£(J)^~^£(/)^ (7.3) 

where 5 was introduced in (14. 4p . This is just (14. 5 p for disjoint /, J with I not essentially bad 
(see the definition at the beginning of Subsection 14. ip . 



7. The rest of the long range interaction 
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Lemma 7.1. Let I, J be disjoint, < 2 ^i{J), and satisfy (17. 3p . Then 

(dist(/, J) + £(/)+£( J)) 



Proof. If dist(/, J) > i{J), this has been aheady proved in Lemma \QA] And actually it was 
proved there with a better numerator: So let dist(/, J) < i{J), I, J being disjoint. 

Repeating (16.41) one gets 

Now we estimate the kernel: < A ■ Therefore, 

\{T,A'if,A':,gU < A^-^^^^^ (7.5) 
We use ( 17. 3p to write 



i{J)J (£(/) + dist(/, J)+£(J)) 



d+e 



because we assumed dist(/, J) < £(J) and / is shorter than J. This inequality and (17. 5p 
finish the proof of the lemma. □ 

Let us notice that Lemma 17.11 allows to write the following estimate for the sum ai from 
(EH): 



oo 



(7.6) 

Or 

n=0 fceZ 7,J:£(/)=2-"+'=/(J)=2'= ^ V ) 7 I 

(7.7) 

To estimate "the n, fc" slice 

7',J:£(/)=2-"+'=,£(J)=2'= ^ V ' ^ / 
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let us use again the notations 
Also 



Ky{t,s) := 1/ > 0, t,sGR. 

Then 

<7„,fc < / dii{t) / diy{s)K2k{t,s)(p{t)^lj{s) . (7. 



Lemma 16.21 now gives as before the estimate of ai. First of all the lemma gives the 
following estimate: 

/gI"S^(/)=2-"+'= J<^V"J{J)=2^ 

By Cauchy inequality 

k k J^V ,l(J)=2l= /gI'M,£(/)=2-"+* 

(Eii^:^^ii')'^'(Eii^'/iP'^'^ii/iuN^ 

by dUD. Then gives ai < Er=o 2""'^' Efc o-„,fc, and so 

oo 
n=0 

and our long range interaction sum cti is finally estimated. Symmetric estimate holds for a2 
from (OD- 



Conclusion: if f,g are good, then the sum of all terms KTf^^A'^ f, Ajg)^\ such that either 

£(J) 



m. e [2-^ 2^ or / n J = has the estimate c [/i, z/]^^^ 



8. The short range interaction. Corona 

decomposition. 

As always all J's below are in V^, all J's below are in V^. 
Let us consider the sums 

p--= E {A>ifXAj9),. (8.1) 

|i'|<2-<^'-|J|,/CJ, ,/is good 



8. The short range interaction. Corona decomposition. 



17 



E {T,A'if,A^jg)^. (8.2) 

I J|<2-'*'-|J'|, JC-f, JGXI^ , Jis good 

They can be estimated in a symmetric fashion. So we will only deal with, say, the second 
one. It is very important that unlike the sums Sj, (Tj, this sum does not have absolute value 
on each term. 

Consider each term of r and split it to three terms. To do this, let J, denote the half of 
/, which contains J. And J„ is all other sons. Let / denote an arbitrary super cube of Jj in 
the same lattice: I G V^. 

We write 

{T.A^f, A'^jg)^ = (T,(x/, A^/), A'^jq), + {T,{xiA^f), A'^jg), = 

{T,{xi^AU-),A^g)^ + {A'if),jXUxi),^j9)u-{A^f),jXT^^^^^ 

Here (Aj/)^^/. is the average of A^/ with respect to /i over Jj, which is the same as value of 
this function on Jj (by construction A^/ assumes on / 2'^, one on /j, others on J„). 

Definition. We call them as follows: the first one is "the neighbor-term" , the second one is 
"the difficult term" , the third one is "the stopping term" . 

Notice that it may happen that / = /j. Then stopping term is zero. 

8.1. The estimate of neighbor-terms 

We have the same estimate as in Lemma [7.11 

|(T„(x,,.A?/),A5j)„|< 

Of course, ||x/„A^/||^ < || Aj/||^. So the estimate of the sum of absolute values of neighbor- 
terms is exactly the same as the estimate of ai in the preceding section. 

8.2. The estimate of stopping terms 

We want to estimate 



First of all, obviously 



Secondly, 



I(a^/).,/JI(t,(xa/),a:;, 

l(A^/).,/J< 



Ai'/IU 



\iT,ixi\,),A^jgU = \ixi\,X^^j9),\ < 



18 



This is the usual trick with subtraction of the kernel, it uses the fact that J AjQ dv = 0. We 
continue by denoting the center of /, by c(/j). Consider two cases: 1) dist(x, J) < 10£(/), in 
this case (we use that J is good) 



£{I)J dist(x,c(/i)) 



d+e 



2) dist(x, J) > 10£(J), in this case 



dist(x, " ^ J dist{x, c{Ii)Y+' ' 
We continue, using the definition above. 



< Au{J)y^A^g\\^[^Y\iX}\Jd^^) . 



Thus 

We now get the estimate of the stopping term: 



8.3. Pivotal property 



Let / G V^. Let {/«} be a finite family of disjoint subcubes of / belonging to the same lattice. 
We have called (at the beginning of the paper) the following property pivotal property: 

J^^-PiMixMMla) < /i(/) . (8.6) 

a 

Recall that in the case 

/i = w~^dx , u = wdx , G ^2 
the property (18. 6 p is satisfied with 



8. The short range interaction. Corona decomposition. 
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8.4. The choice of stopping cubes 



Fix a cube / G V^. Let us call its subcubes I G a stopping cubes if it is the first one (by 
going from bigger ones to the smaller ones by inclusion) such that 



Vj{xi\idfi)\ > 100 Kfi{I), z = 1,2, 
where K is the constant from ( I8.6p . 

Here is the place, where we use the pivotal property ( 18. 6^ : 

Theorem 8.1. If [i^v are arbitrary positive measures such that 
every I G T)^ 

laTii^ , I<Zl , lis maximal stopping 



is satisfied, then for 
(8.8) 



Proof. In fact, let {la} be a family of maximal stopping cubes inside / according to stopping 
criteria just introduced in (18.71) . Then 



100 K 



cubes {la} are disjoint subcubes of /, and so (18. 6p is used now: 



100 

a 

K . 1 
100 K^^ ' - 2^^ ' 



□ 



Definitions. 1. For any dyadic cube J, F{I) will denote its father. 

2. The tree distance between the dyadic cubes of the same lattice will be denoted by t(/i, 12). 
Of course t(J,F(J)) = 1. 

3. Stopping cubes of the same lattice will also form a tree. We will call it S. The tree 
distance inside S will be denoted by r(S'i, 5*2). Of course 



r(5i,52)<t(S'i,S'2) 



(8.9) 



8.5. Stopping tree 

In Section [8] we introduced the sum, which we are left to estimate: 



E (t,a^/,a:;<7).. 

I J|<2-*|/|,JC/,JGX'^,Jis good 



^.10) 
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Each term of r was decomposed into three terms. We recaU: let Jj denote the son of /, 
which contains J. And /„ is the union of other sons. Let / denote an arbitrary supercube 
of li in the same lattice: I G T)^. 

For a given / G V^, J C I, J E V^, J good, we write down the following sphtting 

{T,{xi^A^f),Ay,g), + {A'if),jXUXf),^j9)u-{A^f),jX^^^ (8.11) 

Here (A^/)^ /- is the average of A^/ with respect to fi over Jj, which is the same as value of 
this function on Jj (by construction A^/ assumes on / exactly 2'^ values, one on Jj, others 
on In). 

We called them as follows: the first one is "the neighbor-term", the second one is "the 
difficult term" , the third one is "the stopping term" . 

In what follows it is convenient to think that we consider our problem on the circle T 
rather than on the line. We want to explain how to choose J in a stopping terms above. 

Construction of the stopping tree S. We choose first / = Jq, where Jq is the unit cube of 
the lattice which contains the support of /. The choose its maximal stopping subcubes 
{/}. Just use the criterion f l8.7p from Subsection 18.41 Call each of these J's by the name S. 
In each S again find its maximal stopping subcubes {S}. Et cetera... . All cubes, which 
were thus built, we call "stopping cubes". They have their generation. Stopping cubes, as a 
rule, will be denoted by symbols with "hats" . 

To explain the choice of / in a stopping terms above we need the notations. If i? is a 
cube in we call Q^i the cube in one more dimension built on R as on its base. Sometimes 
we call L{R) its upper face. 

Notations. If G is a stopping cube, and S = {S}, S G is a collection of its maximal 
stopping subcubes (we call them stopping suns of S, there stopping tree distance to S is 
one: r{S, S) = 1), we call Og the collection of all cubes J from the lattice V^, such that the 
top side of the cube Qj built on J as on its base lies in the set fl^ := {Qg \ UsesQs)- In 
particular, S E Og, but its stopping suns are not in C^. 

The choice of / in a stopping terms above in (18. lip is as follows: let /, J be as above, 
namely J G I, J E V^, J good, J C Jj, where Jj is a son of /, we choose the first (and 
unique) stopping cube S such that Jj G Og. Then we just put I = S. 

Definition. Recall that the father of an cube / with respect to the tree of all dyadic cubes 
was called F{I). If G 5, then its father with respect to tree S will be always called from 
now on S. 

Let us introduce the sum of absolute values of the "stopping terms" of the sum r above 
(as always / G P^, J G V). 

\J\<2-^''\I\,JCl,JeV^,Jis good 
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To estimate it we can use f lS.Sp . Then (recall that Jj is the half of / containing J) 

t<Ar, r:= {^y\j7f^Y\i^^^^^ 

|J|<2-*|/|,JC/, Jis good ^ ^ 

Theorem 8.2. Let in the sum T above I means the smallest stopping tree cube containing 
li . Then 

T<c^\\fU\g\\u- 

Proof. Put 

|J|<2-'''-|/|,JC/,,^(/)=2'=,^(J)=2-"+* ^' 

Then abusing slightly the notations we denote the sons of / by Ji, /2, . . . . We get 

2<* 

i=l £(7')=2'= JC/i,£(J)=2-"+'= ^' 

Consider only Ji. By the Cauchy inequality the estimate will be 

a/2 



£(/)=2'= JC/i,f(J)=2-"+'= "^"^ JC/i,^(J)=2-"+'= 

The middle term is bounded by \Ph{Xi\i^ dfi)]'^h'{Ii)/ By (18 .Tp we get that the middle 

term is bounded by -\/lOO K. In fact, this was our choice of /, which ensures that Ii G Of, 
and so (18. 7p holds. 

Thus, the last expression above is bounded by (this is just the Cauchy inequality) 

lo^/K E iiA^/iu( E \\^j9\\iy^" < 

i(I)=2'' JCIi,e{J)=2-"+*' 

lo^i E mfwim E E \\^j9\\iy^'- 

£(7)=2* £{7)=2'-- JC/i,^(J)=2-"+'= 

As a result we get the estimate on rn,k- 

r.,.<iov^( E w^^f&^'i E \\^^9\\iy^"- 

Now it is obvious from the formulae for T and rn,k that 

n k 

But from the estimate above and the Cauchy inequality ^f,rn,k < 10 So we 

get Theorem 18. 2[ 

□ 
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9. Difficult terms and several paraproducts 

Let us recall /, g are good functions and that in the sum 

E Mf^^^jg).. (9.1) 

\J\<2-'i^\I\,J(lI,j£V^,J\s good 

we consider each term of r and split it to three terms. To do this, let /j denote the son of /, 
which contains J. And /„ is the union of other sons. Let S denote the smallest supercube 
of li in the same lattice: S G P^, S & S such that 

heOs, (9.2) 

where the family of cubes Os was introduced shortly after flS.lOp . (In other words S is the 
smallest stopping cube containing Jj.) 

We wrote 

(T,A^/, a;;^), = (t,(x/„a^/), a;;^), + (t,(x/A^/), a;;^), = 

{T,{xi.^^if).^''jg)u + {^'jf),jXTMs).^''j9)u - {^'jf) 

Here S is the smallest cube from the stopping tree S such that Jj G Os- Also here (Aj/)^^/. 
is the average of A^/ with respect to ^ over Jj, which is the same as value of this function 
on Jj (by construction A^/ assumes on I exactly 2^^ values, one on each son). 

The sum of absolute values of the first terms and the sum of absolute values of the third 
terms were already bounded by (coA/[yU, i']a2 + ci\/^)||/||^||(7||,y in the preceding sections. 
Middle terms were called "difficult terms" , and we are going to estimate the absolute value 
of the sum of all difficult terms now. This is the most difficult part of the proof. 

Let {Sjse^ denote the family of stopping cubes of all generations. In what follows the 
letter S is reserved for the stopping cubes. Recall that S also denotes the stopping cube, the 
father of S inside the stopping tree S. 

Notations. Let 5 G 5 be an arbitrary stopping cube. We denote by P^^Os ^^e orthogonal 
projection in L'^{fi) onto the space generated by {A^}, / G Os (we mean the images of 
these projector operators), and we denote by ^u,Os the orthogonal projection in -L^(i^) onto 
the space generated by {Aj}, J G Os, J is good (we mean the images of these projector 
operators) . 

We fix / G V^', it defines S eS (see (ESD), we look at terms 

(a^/),,,,(t,(x5),a:;^).. 

We can write each of the term {A'^ f) ^j.(T^{xs) , ^j9)u with fixed S and / G Os, J G Os as 
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The definition of ts . We collect all of these terms with / G Os,I G V'',J G Os, 
J G V, \J\ < 2~'^^\I\, J is good. The resulting sum is called r^. (In summation below 
we should remember that /, g are good: so we can sum over all pertinent pairs of /, J 
remembering that some of A's are zero anyway.) 

We first fix good J, then summing over such J's gives (such J's should contain J, and 
they form a "tower" of nested cubes, from the smallest one called i{J) to the largest one 
equal to S; notice that the summing of quantities {/\^ip) over such a "tower" results in the 
average over the smallest cube minus the average over the largest cube of the "tower", the 
latter one being zero in our case because the /i-average over S of any A^(/) with L C S* is 
zero; we are dealing only with such L's now, as L's are in Os by our definition of ts above). 

(^A.,Os/)^./{j)(AjT^(Xs),Pv,Os^)^, 
where /(J) G Os,l{J) G V^,i{l{J)) = T(\J). 

9.1. First paraproduct 

Let us introduce our first paraproduct operator 

levt^jeOs Jev,j£Os,Jci,e{J)=2~'-e{i),jis good 
Then the absolute value of the sum rg above is 

\{^T,xsP,,Osf,K,Os9U < Ci \\P,,OsfUK,Os9\\u, (9.3) 
where Ci is the norm of t^t^xs operator from L'^{^) to LP'{i')- 

Theorem 9.1. The norm of operator ttt '^^ '^'^ operator from L'^ifi) to L'^{i') is bounded 
byc{VK+^^). 

Proof. Obviously, by orthogonality 
where 

aj:= Y \\^jT,iXs)\\l ■ 
Je<s>ii) 

and $(/) := {J : J eV^J e Os, J C I, i{J) = 2-"£(/), J is good} 

The Carleson imbedding theorem (see and in this context [32j) says that the bound- 
edness of the sum X]/gdm /eos I(v')m,^P'^^ 4C||v9||^ follows from the following Carleson 
condition 

yieV>',IeOs Y «^<C'/x(/) (9.4) 

eevi^/eOs/ci 
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Of course if we put ^(/) := {J : J e , J e C5, J C /, | J| < 2-'^^|/|, J is good} we notice 
that 

a,= \\^'jTMs)\\l=\\ Y ^jUxs)\\1. 

By duahty then 

Y sup \ Y iT^{Xs).^''A)u? < 

sup I Y iT,ixs\i),A''j^/jU'+\\T,ixi)\\l- 

So (I3.1ip imphes 

Y sup \ Y iT^iXs\i),Aj^U' + K^fi{r). (9.5) 

Let us consider the term (T^{xs\i), ^j4')u, J € ^(-^)- Exactly this quantity was estimated 
in dH^D- We get 

\{T,{xs\i),Ay,^U < Au{jy/^A^^\\^(^^y%j{xs\i)df^^ 
So the first term in (19.51) is bounded by (we use the Cauchy inequahty) 

E (§^)'[^/(X5v)rf^]M^)<E2""^ E [Vi{xs\i)df^]MJ) = 

J: JG'fC/) ^ ^ ^ ^ n e{J)=2-"e{I),JCl 

n 

as WipWu = 1. It is time to use the fact that I E Os, which means that the stopping criterion 
f l8.7p is not yet achieved on /, in other words that 

[Pi{xs\i)dfi]Ml)<^OOKfi{I). 

Combining this with (19. 5 P we get (19. 4p : 

Y a,<c{K + K^)^{I). 

And Theorem 19. H is proved as the norm (as we already remarked) of our paraproduct operator 
is the square root of 4 times the constant in the previous inequality. 

□ 
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Let us recall that we introduced above the definition of ts, for stopping cube S. We 
finished the estimate of the sum of ts over all stopping 5* (recall that the set of all, stopping 
cubes was called S): 

J2^s < c(v^+ J2 \\P,,Osf\\,\K,Os9\\u < c(v^+ (9.6) 

the last inequality following from the orthogonality of P^^Osf different S (the same for 
^u,Os9) the Cauchy inequality. 

9.2. Careful bookkeeping: two more paraproducts 

Definition. Similarly to fu,Os defined above we define ^u,Qs and fu,Qs\Os projections on 
the sum of Aj with good J such that J lies in Qs and Qs \ Os correspondingly. 

In the previous subsection we have estimated a piece of the sum of the difficult terms 

{A>if),j^{T,ixs),^jg)u, (9.7) 

namely, we estimated the sum of such terms, when J, J lie both in the same family Os, 
where S E S (arbitrary stopping cube). Such a sum was called ts, and we just proved in 

dnSD that E5G5^5 < C{^/K+ V^)||/|U||^?||.. 

What is left is to estimate the sum of abovementioned terms when J G Os and / 
belongs to another Osi , where S, Si are both stopping cubes. As / is larger than J, we 
have to consider the pairs of stopping cubes, where S is strictly inside Si {Si is one or more 
generations higher in a stopping tree S than 5"). 

Let us recall that F{I) denote the father of / inside the standard dyadic tree. Let us fix 
J. Let Sj C Sj-i C Si = Iq be the whole (finite) sequence of stopping cubes of successive 
generations containing J. So Si^i is a father of Si in the stopping tree S. Hence, it is not 
true that Si^i = F{Si) in general! Notice also that (A^/)^ /. is the difference between two 
averages of / with respect to fi, one over Jj and one over its father I. It is easy to some up 
successive differences and summing all above mentioned terms with fixed J we get (omitting 
for brevity the common factor of the scalar product: A'^j{g)): 

{{f)li,F{Sj) - if) ti,F2iS,))Tf,XSj-i H \- {{f)ii,S,-i - {f)ii,F{S,^i))T^XSj-i + 

((/)m,F(S,_i) - {f)ti,F^{Sj-i))Ti,XSj-2 H ((/)m,5j_2) - {f)ti,F{Sj-2))T,iXSj-2 + 

••• + 

((/)m,f(52) - {f)^^,F^S2))T^,XSl H ^ - if) fi,F{Si))T^,XSi ■ 

Recall that we are working with /'s such that two last averages will be zero. Regrouping 
we obtain 



if) fi,F{Sj--,)T^XSj-2\Sj-i H \- if) fi,F(S2)Tfj.XSi\S2 
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and 

k k 

Let us consider the first sum and let us now collect all pertinent J's. Because in the 
first sum J e Osj and averages are over S with indices strictly smaller than j we obtain the 
following sum by collecting: 

SeS 

Let us consider the second sum and let us now collect all pertinent J's. We get 



s&s 

However, there is also 7Tg\f,g) because so far we collected all difficult terms such that 

JeOs, lieOs', S c S' , S S' , r{S, S')>1. 

But we have to collect also the difficult terms such that J and li are in the same Os but / 
is already not in it: 

J,Ii& Os 1 I E Og, S is the stopping father of S , that is the terms with li — S . 

This gives us terms (in the previous notations li = Sj, I = F{S.j). I = Sj. the last equality 
is just exactly how we chose / in the definition of difficult terms, these are situations when 
we do not have stopping terms, they vanish) 

i{f)ij.,ii - if) f^,i)Tf,XSj = {{f)f^,Sj - if) ix,FiSj))Tij,XSj ■ 
Collecting we obtain 

4\f:9) ■■= J2(f)s{T,XS:K,Os9) - J2{f)F(S){T,XS:K,Os9) ■ 

SeS SeS 

Now we can consider sum of all difficult terms p — 'n's\f,g) + '^s\f,g) + '^s\f,9) by 
uniting the sum -J2ses{f)F{S)iTiJ.Xs,K,Os9) with 'KP{f,g), and then uniting the result 
with (/,^). The sum J2sesif)s{TixXs,'^u,Os9) stays alone: 

P = ^{f)^^,F(S){T^,Xs\s^'^'^,Qs9)u + ^{f)^,,siT^^Xs,K,Os9)u =■ Pi + p2 ■ 
ses ses 

We introduce now two paraproducts: 

^""f J2(f)^'SK,Os{T,Xs) , 
ses 
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^'^f--=J2(f),,Fis)^.,QsiT,Xs\s)- 

Then pi = {n'^ , g)^, p2 = {Tx'^^g)^,. So to finish the proof of our Theorem 13.31 it is enough 
to prove the boundedness of these paraproducts as operators from L'^{fi) to Ll^iv) with the 
estimate of norm < c {y/K + ^JK^). 

To prove the boundedness of the first paraproduct let us use Theorem 18. 1[ Consider the 
sequence 

{hs]ses. hs := \K^Os{T,Xs)\\l ■ 

It is a Carleson sequence: 

yieV^" J2 bs<cK^fi{I). (9.8) 

sci,S€S 

In fact, bs < \\Tf,xs\\l < K^^i{S) by ^M)- Now ^M) becomes clear by Theorem Ell 

Notice that "^v^Os mutually orthogonal projections in LF'{i') for different 5*. This is 
just because the families Os are pairwise disjoint for different S & S. This is exactly what 
helped us to cope with n'^ f so easily, we just used 

h'^fWl = II Y.^f),,s^u,Os{T,xs)\\l = Y.\^f),,s?Wu,Os{T,xs)\\l = Y.\^f),,s?as. 
ses 5e<s ses 

This is where the orthogonality has been used. And we applied then the Carleson property 
of {bs}s£S- We already saw this type of paraproducts with the property of orthogonality 
(see [22], [32], and especially Theorem 19.11 above). And we know that Carleson condition 
(19. 8p is sufficient for the paraproduct operator vr*^ to be bounded with constant l^cK^. 

The second paraproduct tt*^ is a quite different story because projections P^Qs , 5* G 5 are 
not orthogonal. 



9.3. The second paraproduct tt^. 

So ||vr'5/||^ has the diagonal part but also the out of diagonal par: 

||vr^/||^ < DP + ODP, 

where 

DP--=Y.\(f)^^ns)\'\K,QsUXs\s)\\l, 

ODP:= \if),,FiS') 
s,s'es,s'cs,s'j^s 

|(/)/.,F(5')II(/)m,F(5)||(P!.,Qs,7;(X5\5)'P!^,Qs'^m(X5'\5')i^I • 

S,S'eS,S'cS,S'^S 
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We start with ODP. Recall that r = r{S\S) is the generation gap between 5" and S*, 
5" C 5* in the stopping tree S. Choose a small Eq depending on e of Calderon-Zygmund 
assumptions: 

ODP < j2 i(/).,n5)nip.,Q,,T,(x^^,)ii^ ■ (1 +5o)^(^'")+ 

s,s'es,s'cs,s'j^s 

E l(/)M,i^(50l1P.,Q..r.(X5'\5')f.-(l + ^o)-^(^'")< 

S,S'&S,S'cS,S'j^S 
oo 

Y.\{f),,FiS)\'Y.(l+eoy l|P.,Qs'^.(X5\5)ll'+^(^0)5^|(/),,F(5)n|P.,Q.r,(x 
SeS j=l S'&S,S'cS,r{S',S)=j sgs 

Now we need to estimate these sums 

^.:=EK/W)I' E \K,Qs'T,iXs\s)\\l J = 1,2,3,..., 

5e5 5'e<S,5'c5,r(5',5)=j 

By the way, = DP. We need to see that Fj are exponentially small. 

All such sums have the form of Carleson imbedding theorems. So we need to check 
countable number of Carleson conditions now. 

Carleson condition for Fj. We introduce the sequence 

as ■= \\^u,QsT^{Xs\s)\\l^ ^ S,r{S,S) = 1. 



And also 
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E \K,QsJ,iXs\s)\\l riS,S) = 1, J = 1,2,3, 

S'eS,S'cS,r{S',S)=j 



We will need the following Lemma. 

Lemma 9.2. Let S' ^ S G S be cubes ofV^. Let the tree distance between S' and S with 
respect to the tree satisfy t{S', S) > j, j = 0, 1, 2, ... Then 

\KAT,Xs\s)\\l < C2-^^/MS'){VsXs\sdf^?. 

Proof. Let \\ip\\u = 1- Let us consider the term (T^(x^y^), Aj-?/'),^, J G Qs'- Exactly this 
quantity was estimated in f l8.4p . We get 



So each our projection can be estimated as follows 

\K,QsT,{Xs\s)\\l<i'PsiXs\s)df^r E ^(J)(jS?)^^'- (9-9) 

Jgood,JcS' \ \ )/ 
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So \\¥^^s'{T^Xs\s\\l is bounded by 

oo 

t=j £{J)=2-H{S),JCS' 

which proves the lemma. □ 

We first estabhsh a Carleson property for {as}- Let / be in P^. 
We need to prove 

J2 \K,QsT,iXs\s)\\l < CKfiil) . (9.10) 

ses,F{S)ci 

Consider the family of our largest stopping cubes {Sa}a£A such that F{Sa) C /. We consider 
their father. We call it S abusing the notations slightly. 

Using our notations for father in the stopping tree S we can write 

Sa = S \/a e A. 

Notice that 

CPsAXsxFisJdfiyuiFiSa)) < lOOKfiiF{S^)) \/a eA. (9.11) 

But this is not true with replacing F{Sa) by SJ. Let us use naively (19. lip and Lemma Wl2[ 
Then we get 

a&A a&A aeA 

200 K J2 f^iHSa)) + 2J2\\ (T.XFis^) \\l < (200 K + K^)J2 KFiS^)) ■ 

a£A a£A a^A 

In other words we would like to conclude that 

E \K,QsT,{Xs\sJ\\l < c{K + K^)ii{I) . (9.12) 

But instead, by naive reasoning we achieved 

Y,\\^u,QsT,{Xs\sJ\\l<ciK + K^) J^KFiS^)). (9.13) 

aeA a£A 

This is a dangerous place because while the cubes Sa are pairwise disjoint, there fathers 
F(S'a)'s are usually not and we cannot deduce (19.121) from fl9.13p . as this is not guaranteed 
that 

$^Mi^(5«))<c/i(/)- 

We cannot use doubling. This last inequality actually is usually false. 
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However, fl9.12p is true. But the way to prove it is more subtle. Let us do it. Let {F^j/^gs 
denote the family of maximal cubes among {F{Sa)}a£A- Let for a given (3 E B the family 
{Si^^^} denote all cubes from {Sa}aeA that lie in Fg. Now 

/3GB 7 P&B 7 

For the second sum: 

7 7 
7 

by our Sawyer's type test assumption (I3.10p . Also we can use now the disjointness of Fp to 
conclude that 

S2 <4ir^^(/). 
For the first sum we use Lemma 19.21 to conclude 

^1 < EE('^^/3^5\f//^)^'^('5/3,7) < 

/3eB 7 

P&B I3&B 

We used here the disjointness twice. 

For the second sum we can use another estimate-via K, not K^-M we use Lemma 19.21 
again: 

^2 = Y.ll\\^'^^Qs,^T,XF,\s,M< 

l3eB 7 

EE(^%.^^A^^,.)^'")M^/3,7) <^E'"™ ^^M/)- 

peB 7 /3eB 

We used here (18. 6p . (We use it here for the second time in our proof, the first one was in 
Theorem 18.11 notice that the first method of estimate S2 does not require the use of (18. 6p . 
but instead involves constant K^, here we use only constant K, it may be important for 
something.) 

Finally (I9.12p is proved. But to prove the estimate of Carleson type for {a^j^g^ we need 
not just (KT2\i but 

E \\^'^^QsUXs\s)\\l < CK^iil) . (9.14) 

SeS,F{S)Cl 
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We estimated not the whole sum in (19.1 7p but only the sum over maximal S such that 
S E S, F{S) C /. In other words we estimated 

E \K,QsT,{Xs^\sJ\\l < CKf^il) . (9.15) 

Sa(^S,F{Sa)d,Sa is maximal 

But the standard reasoning shows that (19.151) is enough to prove (I9.17p ! In fact, if our S in 
the sum in (I9.17P is not maximal it is contained in a maximal one. Denoting by 5*^(0;) the 
maximal such S contained in we conclude 

j 

We sum over j and a and notice that our main stopping property says 

a 

This gives the sum over maximal cubes inside maximal cubes. Next generation of stopping 
cubes will give a contribution ^/i(/) because 

a j a 

yet next generation will come with the contribution |/i(/) et cetera... All this is because of 
Theorem 18. 1[ And we obtain (I9.17p . 
This gives 

DP = Fo<CK\\f\\l. (9.16) 
We are left to estimate ODP or rather to give an exponentially decaying estimates of sums 

9.4. Miraculous improvement of the Carleson property of the se- 
quence {a^s}ses 

We used Lemma 19.21 above. But we used it only with j = 0. Now we will be estimating 
Carleson constant for {a-^^ys^s and it should be exponentially small. We will use again 
Lemma [9.21 but with j > 0. Recall that r{S', S) denote the tree distance between these two 
cubes inside the stopping tree. We again consider I G V^, the smallest S E S containing I. 
We need now the estimate 

E E \K,QsMXs\s)\\l < C^2-V(J) . (9.17) 

SeS,F{S)Cl S'CS,r{S',S)=j 

We repeat verbatim the reasoning of the previous section, and of course 2'-'^/^ appears 
naturally from Lemma 19. 2[ We just use the fact that cubes S' involved in ^u,Qg, have the 
property 

tiS',S)>riS',S)>j. 
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The only place where one should be careful to get the extra 2 ^-^ is the estimate of S2. 
Now the estimate via will not work. We cannot use 

E w.AT,XF,\s,,x< 

2E E W.AT,XF,)\\l + 2Y,\\T,Xs,X < K^fi{Fp) 

anymore. Actually we can use it but this does not give extra 2^'^^. Instead, we use our 
second method of estimating S2 by Lemma 19.21 

7 S'cSB,j,riS',Sa,^)=j 

7 5'c5;3,-,,»-(5',S'^,^)=i 

C2-^^/' E K^^,7)(^S,.,XF,\5,„rf/i)' 
7 

For a fixed /3, the cubes Sp^^ are disjoint by their construction (see above). It is time to use 
(18. 6p . (We use it here for the third time in our proof, the first one was in Theorem 18. 11 the 
second time was the estimate of S2 for DP above via K.) If we apply (18. 6p to the last sum, 
we get 

Y,^i^^n)iPs^,,XF^\Sp,.ydf^f < Kfi{F(s) . 
7 

Therefore, 

E E \KAT,XF,\s,J\\l < c2-^V2;^/i(F,) . 

7 S'cSfi,^,r{S',Sp,^)=j 

We already said that all terms, in particular, the analog of the sum Si also get 2"-''^/^ factor. 
This is nice as we get 

E a^,<c2-^V2^(/). 

S&S,F{S)Cl,S is maximal 

Now we again need to estimates the whole sum 

E a^5<c2-^-^/V(/). (9.18) 
ses,F(S)ci 

This achieved exactly as before with the help of ( 18. 8p of Theorem 18.11 We consider 5*0, 
to be maximal S G S,F{S) C /, and then for a fixed a consider Si{a) to be maximal 
SeS,F{S) c Sa. 

Next generation of stopping cubes will give a contribution |2~''^/^/i(/) because 

EE'"(^^("))^^E'"(^-)^^M/), 

a i a 



1 0. The proof of Tlieorem \2.6[ An estimate of via the weak norm 
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yet next generation will come with the contribution j2 ■''^/^/i(J) et cetera... And we get 
(I9.18P . All this is because of Theorem 18.11 

The proof of Theorem 13.31 is finished at last. 



10. The proof of Theorem 12.61 . An estimate of 



VIA THE WEAK NORM 

We need to estimate the best constant in inequahties 

\\Txiw-Y^<Bw-\l). 

and 

\\T'xiw\\l.,<Bw{I). 

Recall the Lorentz space 

poo 

L''\w-') := {/ : := / {w-\x : \f{x)\ > t)y/'dt < 00} 

Jo 

By definition 

y;-l(/)V2 ^ ||x/||l2.1(«,-1) . 

Therefore the inequalities above can be rewritten in a different fashion: 

\\TXlU)~^\\w < \\Xl\\L2,i{w-i) ■ 

and 

The first one can be further rewritten as 

nTxiw"'^ 



-1 < Vb ||x/||l2.i(«,-i) • 



Therefore, if we denote by T he operator from L'^'^{w ^) to ^) acting by the rule 

Tfw-^ 



w ^ 



we obtain 

Tfi < ||T|| . 

Take a look at the adjoint operator (the duality is with respect to / • • • w~^dx). It is just 
/ ^ T'f from L2(w-1) to L'^'^ivj-^). Therefore 

< max(||T' : L\w-^) L^'°°{w-^))l \\T : L^w) L''~H||) 
Theorem 12.61 is completely proved. 

Combining this with Theorem 13.31 we see that Theorem 12.11 is already proved. 
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11. The proof of Theorem UM Extrapolation type 



APPROACH. 

This is an abstract theorem, T is arbitrary here such that 

suptiy({x : \Tf{x)\ > t)} < <P{[WU)\\f\\LHw) (H-l) 
t>o 

for any / G L^iW) and any W e Ai. 
We are going to prove that 

snptwiix : |T/(x)| > t})'/' < c0(MaJ||/1|l^h (11.2) 
t>o 

for any / G L?{w) and any w G A2, where c is depending only on dimension. 

Fix such a w. We can consider the case t = 1 only, this is just homogeneity of both parts 
of ([IL2D- Denote 

n:= {x: |T/(a;)| > 1}. 

We can write 

winf^ = sup I / hwdx\ . (11.3) 

Consider operator 

, _ M{fw) 



W 

where M stands for Hardy-Littlewood maximal function. Notice that S^f > /• 
By Buckley's theorem we know that 

\\S^:L'{w)^L\w)\\<c[wU,. (11.4) 

Using this fact we generate 

°o ok I. 

Then obviously Rh > h and 

S^Rh < 2\\S^\\Rh < c [w]a2 Rh, . 

Which, by definition of Ai means 

wRh G Ai , [wRh]Ai < c [w]a2 ■ (11.5) 

Call W := wRh. Using f lll.3p with appropriate h which almost gives a supremum and 
using the obvious fact Rh > h we write 



w 



(ny/^ = 2 j hwdx < J wRhdx = W{n) = W{{x : \Tf \ > 1}) . 
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Now use the estimate of Ai norm of W = wRH, namely, flll.5p . Then 

< 0(c[w]aJ / \f\wRhdx. 



Hence, 



But 



oo 



\\sth\\ 



w 



< 2. 



k=0 



Putting this into the previous inequahty we obtain < 20(c[w]yi2)||/||^ • Theorem 12.21 



All the results from above can be proved for Calderon-Zygmund operators on homogeneous 
metric spaces. 
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